CrystalBall : A Framework for Mining Variants of Association Rules

نویسندگان

  • Kok-Leong Ong
  • Wee Keong Ng
  • Ee-Peng Lim
چکیده

The mining of informative rules calls for methods that include different attributes (e.g., weights, quantities, multipleconcepts) suitable for the context of the problem to be analyzed. Previous studies have focused on algorithms that considered individual attributes but ignored the information gain in each rule when the interaction of two or more attributes are taken into account. Motivated by the above, we developed a framework called CrystalBall that supports declarative mining of different rules (i.e., variants) involving several attributes. It eliminates the time and cost of engineering algorithms as practiced in previous studies, and introduces a foundation for cross-variant enhancements. The framework consists of a generic rule mining engine (VI), and a variant description language (VDL) for defining attribute-specific behavior. Besides demonstrating the flexibility of the framework, we also discuss the experimental studies, the limitations of the framework, as well as future work in the paper.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Variants of Rules Using the CrystalBall Framework

The mining of informative rules calls for methods that include different parameters (e.g., weights, quantities, multipleconcepts) suitable for the context of the problem to be analyzed. Previous studies have focused on algorithms that considered individual parameters but ignored the information gain in each rule when the interaction of two or more parameters are taken into account. Motivated by...

متن کامل

Retaining Customers Using Clustering and Association Rules in Insurance Industry: A Case Study

This study clusters customers and finds the characteristics of different groups in a life insurance company in order to find a way for prediction of customer behavior based on payment. The approach is to use clustering and association rules based on CRISP-DM methodology in data mining. The researcher could classify customers of each policy in three different clusters, using association rules. A...

متن کامل

A Framework for Efficient Scalable Mining of Rule Variants

Association rule mining is an important data mining problem. Since its inception, different variants of rules has been proposed in the literature. In each case, different attributes (e.g., weight and quantity) are considered to obtain more informative rules. To our knowledge, each proposal is based on the Apriori algorithm that is, in modern context, inefficient. Methods that outperform the Apr...

متن کامل

Introducing an algorithm for use to hide sensitive association rules through perturb technique

Due to the rapid growth of data mining technology, obtaining private data on users through this technology becomes easier. Association Rules Mining is one of the data mining techniques to extract useful patterns in the form of association rules. One of the main problems in applying this technique on databases is the disclosure of sensitive data by endangering security and privacy. Hiding the as...

متن کامل

Using a Data Mining Tool and FP-Growth Algorithm Application for Extraction of the Rules in two Different Dataset (TECHNICAL NOTE)

In this paper, we want to improve association rules in order to be used in recommenders. Recommender systems present a method to create the personalized offers. One of the most important types of recommender systems is the collaborative filtering that deals with data mining in user information and offering them the appropriate item. Among the data mining methods, finding frequent item sets and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003